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Abstract 

In a random graph, counts for the number of vertices with given de- 
grees will typically be dependent. We show via a multivariate normal and 
a Poisson process approximation that, for graphs which have independent 
edges, with a possibly inhomogeneous distribution, only when the degrees 
are large can we reasonably approximate the joint counts as independent. 
The proofs are based on Stein's method and the Stein-Chen method with 
a new size-biased coupling for such inhomogeneous random graphs, and 
hence bounds on distributional distance are obtained. Finally we illustrate 
that apparent (pseudo-) power-law type behaviour can arise in such in- 
homogeneous networks despite not actually following a power-law degree 
distribution. 
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neous random graphs, power law. 

AMS Subject Classification: 60F05, 05C80, 90B15 

1 Introduction 

It has been observed in many real- world networks that, when plotting the ob- 
served number of vertices of degree k against fc, on a log-log-scale the plots tend 
to look linear. This so-called scale-free behaviour, see e.g. [7], has motivated the 
scale-free network model introduced by [T], yielding a probability distribution 
for the number of vertices of large degree which is scale-free. 

Some issues arise when trying to assess the vertex degree distribution from 
a single network. The log-log scale lends itself to over-interpretation; [TB] dis- 
cusses a good number of pitfalls when trying to test for power law using such 
plots. Moreover, [19j have shown that when sampling from a scale-free network, 
the sampled network will not in general be scale- free. In addition, the total num- 
ber of vertices in the network is fixed, and hence counts for different degrees 
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will be dependent. We shall see in this paper that whether or not the depen- 
dence is negligible depends on the size of the degrees under consideration - only 
when then degrees are large can we reasonably approximate the joint counts as 
independent. We establish these facts by proving a multivariate normal approx- 
imation, with possibly non-diagonal asymptotic covariance matrix, as well as a 
Poisson process approximation, with independent coordinates. We give bounds 
for these approximations which depend on the size of the degrees under con- 
sideration. Finally we shall illustrate that apparent (pseudo-) power-law type 
behaviour can arise in networks which are constructed using independent edges, 
and do not follow a power-law behaviour. 

The degree of a vertex is one of the fundamental summaries for random 
graphs, and hence the degree distribution is a natural object to study. In a 
general random graph on a set V oi n vertices, the degree of a vertex w, 
denoted by D{v), is defined as the number of vertices adjacent to v. The most 
basic model of a random graph is that of Bernoulli graph $^(n,p), introduced 
by Erdos & Renyi [B] . A survey of Poisson approximation for distribution of the 
k'th largest degree for large k in the Bernoulli model J#(n,p), as well as of both 
a Poisson approximation and a normal approximation for the number of vertices 
of a given degree can be found in [5] , with bounds on the distributional distance. 
For the joint distribution of degrees in the Bernoulli model $^(n,p), pjj give an 
approximation with simpler models derived from a Binomial distribution and 
use this for univariate normal approximations. 

While Bernoulli random graphs typically do not model real-world networks 
well, in [6j a mixture model for Bernoulli random graphs is shown to be suitable 
for some biological networks. Under the name stochastic block model a similar 
mixture model has proven successful in the area of social network analysis, see 
[T3]. Here we use the inhomogeneous model ^^(n, {pij}) as a sub-model of 
consisting of all graphs in which the edges occur independently, and for i,j G V 
the probability that vertices i and j are adjacent is pij. This general model 
not only includes Bernoulli random graphs, but also mixtures of Bernoulli ran- 
dom graphs, Newman-Moore- Watts-Strogatz small world networks as defined 
in |il3j . and the expontial random graph model, which is defined by assuming 
in {pij}) that pij — exp {9i + 6j)/{l + eyip [9i -f 9j)}, where {9i,i e V} are 
parameters of the model. For fairly general random graph models which include 
a Barabasi- Albert scale-free model, but do not quite cover the class {pij}) 
in full generality, [4] give a univariate mixed Poisson approximation for the num- 
ber of vertices with a given degree. There is a lack of results for multivariate 
approximations, despite the need to understand log- log plots. In addition, net- 
works consist of a finite number of vertices, and, depending on the complexity, 
the distribution of vertices with a fixed degree may be far from the asymptotic 
regime; thus bounds on the distributional approximations are required. 

In order to understand log-log plots of the number of vertices with degree 
k versus fc, we consider the degree-count sequence W := {Wi,Q < i < n — 1), 
where Wi counts the number of vertices having degree exactly i. The definitions 
of both sequences D := {D{v),v e V) and W can be related by introducing the 
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index set 



T -.^ {{v,i) -.v eV;0 <i<n-l}, (1.1) 

and defining, for {v,i) G F, the Bernoulli random variables ^(t,,i) ■— l{D{v) = 
i), where l(-) is the indicator function. Then D{v) — T.^~QiX(^y^i-j, and Wi = 
E„gvX(„ i). Other interesting statistics may be also obtained by this setting. 
For instance, one may define random variable — Y^iykWi as the number of 
vertices having degree at least k, for < fc < n — 1, and consider the sequence 
Z := (Zfc,0 < k<n-l). 

In the fiavour of probability theory, as the sequences D, W and Z are deter- 
ministic functions of the collection X :— i), {v,i) G F}, the cr-fields <t(D), 
(t(W) and cr(Z), generated by D, W and Z respectively, are all contained in 
the tr-field cr(X) generated by X. The collection X in turn can be represented 
by the point process 2 defined by 

where 5a is the point measure at a, that is, for a set B, 5a{B) = 1 if a S -B, or 
otherwise 5a{B) = 0. 

For the degree-count sequence W = (VKi,0 < i < ri — 1) in two 
results in particular have inspired the current work. [2] give univariate Pois- 
son approximations for the distribution of Wk and Zk, and [9] prove a mul- 
tivariate normal approximation for the joint distribution of any sub-sequence 
{Wdi,Wd2, ■ ■ ■ ,Wd^) of W. Both results use Stein's method; in the context 
of Poisson approximation this method is usually called the Stein- Chen method. 
The applications of Stein's method in these two papers use a coupling construc- 
tion to compute bounds on the errors made in the distributional approxima- 
tions. For graph degrees counts, for any a e F a new graph model ^^"(n,p) 
is constructed, conditional on the model '^^{n,p), such that the distribution of 
^^"(n,p) is the same as the conditional distribution of 'i^{n,p) given = 1; 
this coupling is a special case of a size-bias coupling. The difference between 
the degree-counts in ^{n,p) and in ^#"(7i,p) is then used ingeniously to give a 
bound on the distance to the target distribution. 

In Section [2] we construct such a coupling in the inhomogeneous model 
{pij}), generalizing the existing construction for the homogeneous model. 
This coupling will be the main tool for our distributional approximations, which 
we derive in Section ??. Firstly, in Theorem l3.11 we provide a multivariate nor- 
mal approximation for the joint counts of vertices with pre-described degrees. 
The bound depends on the chosen degrees, and on the heterogeneity of the 
underlying graph. The approximating normal distribution has non-diagonal co- 
variance matrix in general, and hence in the normal limit the counts will often 
not be independent. 

The multivariate normal approximation is suitable when the degrees under 
consideration are not too far away from the centre of the degree distribution. For 
large degrees, a compound Poisson approximation is more appropriate. Indeed 
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Theorem 13.61 gives a Poisson point process approximation for the M -truncated 
point process Em defined by 



where for < M < n - 1, we put Tm ■= {{v,i) : v e V; M < i < n - 1}. 

Using the invariant property of the total variation distance in functional 
transformations of point processes, we obtain, from Theorem 13.61 in Corollary 
13. 71 a multivariate compound Poisson approximation for the M-truncated degree 
sequence Dm := {D{v)l{D{v) > M),v e V) in '^{n,{pij}). The result shows 
that counts for large vertex degrees are asymptotically independent when the 
edge probabilities are not too heterogeneous. All these results also contain 
a bound in distributional distance. This bound depends on the size of the 
degrees under consideration, and on the number of vertices, as well as on the 
heterogeneity in the edge probabilities. 

We illustrate our results using simulations for a Bernoulli random graph as 
well as several classes of inhomogeneous random graphs. Finally we show that 
the log-log plots for vertex degrees can appear to be power-law like, without 
following a power law, when the edge probabilities are small. 

Proofs are postponed until Section 21 

2 A Size Biased Coupling for Vertex Degrees 

The size-biased distribution of a collection of variables X relates to a sampling 
procedure where the probability of an item to be included in the sample is 
directly proportional to its size. Formally it can be defined as follows, see for 
example, [9]. 

Definition 2.1 Let T he an arbitrary index set and let X = {Xa ; a G X} be 
a collection of non-negative random variables with means 'KXa = Aq, > 0. For 
P € I, we say that = {Xf : a e X} has the X-size biased distribution in 
the coordinate if 



for all functions G such that the expectations exist. 

A construction of (X, X''), for each /3 G X, on a joint probability space is 
called a size-biased coupling. For any subset i? C X, we set Xb — T^a^BXa, 
and \b = lE^B- [2] give the following mixture construction of a size-biased 
coupling for X in "coordinate" B: Suppose that \b < oo, and that for (3 G B, 
we have a variable X*^ which has the X-size biased distribution in coordinate 
p as in Definition 12.11 Then the random variable X^ which is obtained as 
the mixture of the distributions X'^, /3 e i? with weights X^/Xb, satisfies that 




(1.2) 



EX0G(X) = A^EG(X'3) 



EXbG(X) = AbEG(X^). 
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The application of this construction for coupling variables for the degree- 
count sequence W — (Wdi, I < i < m) has been carried out in [9], for distinct 
and fixed di (where i = 1, . . . , m). The idea is, for a given degree di and vertex 
V, we force v to have degree di. If the degree of v was equal to di in the 
first place, no adjustment is necessary. If the degree D{v) of v in the original 
graph was larger than di, then D(v) — di edges are chosen at random from the 
edges which include as one end point, and are removed. If D{v) < di, then 
di — D{v) edges of the form {u, v} are added to the graph, where the vertices u 
are chosen uniformly at random from the n — 1 — D{v) vertices not adjacent to 
V. Randomizing over the pair (v, di), for v € V and i — 1, . . . , to, then gives a 
size-biased version of the graph 'i^(n,p). 

For the inhomogeneous model ^{n,{pij}) we use the index set F in (|l.ip . 
which covers all possible combinations between vertices and their degrees, and 
we write Ai = V x {i} for i g {0, . . . , n — 1}. For i € {0, . . . , n — 1} we construct 
(X.X'^) for P G A.„ with X = : {v,i) g F} and = l{D{v) = i), as 

detailed below; we call the resulting graph ^^{n, {pij})- 

Then, we construct (W,W*) by using a random index / in Ai, which has 
the probabiHty mass function P(J = /?) = A^/Aa;, independently of all other 
random variables in the system; we call the resulting graph ^^*(n, {pij})- 

To describe the detailed construction of '^^{n, {pij}), let E denote the (po- 
tential) edge set of the graph model, and define, for an edge {a, 6} G E, the 
Bernoulli random variables ^{a,6} := l(a ^ b) in 'i^{n, {pij}) (and similarly 
-^{a b} '^^{n, {Pij})), where a ^ b denotes the event that a is adjacent to b. 
We also use the following notation: N{v) the random neighbourhood of vertex 
V in '^(n, {py }) (similarly ^'^(w) in '^^{n, {pij})), and Xi = Xi(t;) a i-set (i.e. a 
set with i elements) oiVy \ {v}. 

Construction 2.1 For each (3 = {v,i) G Ai, conditional on 'i§(n, {py}).' 



//£>(«) = d = i, let ^("^*)(n,{Kj}) = ^(n,{py}), that is, set X 
^{a,b} for all {a, b} £ E. 



If D{v) = d > i and N{v) — x^, then we choose x^ C x^ with probability 
proportional to 



'■^ T.7c^JsH\N{v)r^^^\^3, \N{v)n {VA^d)\ ^ j \ d(v) = i) 



and delete all the edges between v and the vertices in x^ \ x^. That is, with 
probability proportional to \2.1\) . we set ^j^'^} = for x in Xrf\x.i, and X^^^'^^j-^ = 
^{a.b} for all {a, b} elsewhere. 

If D{v) = d < i and N(v) — Xd, then we choose Xi D x^ with probability 



/+(x.|x,) 



(2.1) 
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proportional to 



/-(x.|xd) (2.2) 



n — 1 ^ 
j=i \j-iJ 



xP( \N{v) U x,| = J, liV(t-) U (14 \ Xrf)| = n - 1 - j + i I D(v) - i ), 

anc? arfd all the edges between v and the vertices in \ x^. That is, with 
probability proportional to we sef xj^'^j = 1 for x imii\xj^, and xj^'^j = 

■^{a,b} for all {a, b} elsewhere. 

In Section |4] we shall prove that (|2.ip and (|2.2p indeed are probabilities. 
Lemma 2.2 We have that 

X! /^(^« I Xd) = 1 and ^ /"(x,, | x^) = 1. 

Remark 2.3 Note that, in all cases, the above construction \ 2. i\ yields indeed 
that 7V(^'')(w) = Xi if N{v) = x^. 



We shall show in Section [4] that the distribution of ^^^(n, {py}) is indeed 
the same as the conditional distribution of '^{n,{pij}) given Xfj — 1, yielding 
a construction of (X,X'^) for /3 S A,;, which in turn gives a construction of 
(W,W*) via ^^*(n, {py }) using the random index I £ Ai with P(/ = /3) = 
Xfj/Xji-. In the next section, we shall use Construction [^TT] in '^{n, {pij}) to 
obtain a multivariate normal approximation for the degree-count sequence W, 
and a compound Poisson approximation for the truncated degree sequence Dm- 



3 Approximations for Degree Counts 
3.1 Multivariate Normal Approximation 

For a multivariate normal approximation we generalize the argument from [S], 
which is based on Stein's method. Let V — {1, . . . ,n}, let di,i = 1, . . . ,p, be 
distinct numbers in {0, 1, ... , n— 1}, and let Wd^ — T,"^il{D{v) — di) denote the 
number of vertices of degree di in J#(n, {pij})- Denote by W = {Wd^, . . . , Wd^,) 
the vector of degree counts. As D(v) has a Poisson-binomial distribution which 
is cumbersome to write explicitly, we abbreviate 

:= P{D{v) = d). 

Let A — (Ai, . . . , Xp) denote the expectation vector of W; for i = 1, . . . ,p, 

n n 

A, := EWd, ^J2^{D{v) = d,) = ^(Z„,d.. 

V—1 V—1 
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We also abbreviate, for v — 1, . . . , n, 

fly := ED{v) = y^Pn.i), 

u 

where, and as everywhere else, we use the convention p^^y = in ^^(n, {pij}). 
Then 

Var{D{v)) = ^Pu.vi^ - Pu.v) < IJ'v (3.1) 

u 

For vertices f2, • . . , denote by ^("1' - ''''")(7t, — m, {py }) the random 
graph ^^(n, {py}) with vertices wi, W2, . ■ . , I'm and all their edges removed. For 
this graph let Z?^"^' "'"'"-' (w) be the degree of vertex w, where w ^ {vi, . . . 
We let 

qi-u.-,-^) :^P(i?(-i---™)(u,)=d). 

It is straightforward to calculate that the entries of S = (fij)i the covariance 
matrix of W, are 

CTy = l{i = j)Xi - ^ qv,di1v,d, 

V 

When the degrees are "typical" , a multivariate normal approximation for W 
holds, as the following theorem shows. Using the notation from [9], for smooth 
functions h : R, we let Dh denote the vector of first partial derivatives of 

h, and in general D'' the fc*'' derivative of h; \\ h \\ denotes the supremum norm. 
We also abbreviate 

- 1 V- 

Pv / ,Pwv- 

n — 1 '^-^ 

w 

Define So = ((T° ) with a^j given by 

<^tj = 1(* = i)-^» - X! ^^,dSv,d, 

V 

+ yp^ypZiqv.di^i ~ qv,di){qw,dj-i - qwMj)- 

v,w 

Finally let 

M = max|^pi^,;^pi^y 

V V 

The following result gives a bound for the distance between the distribution of 
our degree count vector W to a multivariate normal distribution with the same 
mean as W, but with covariance matrix Eg- The proof can be found in Section 

m 
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Theorem 3.1 For any function h : — )■ ]R having bounded mixed partial 
derivatives up to order 3, 

i/i(I](7^''^(W- A))- Ar/i| 

<pV II 7^2;, II lY^B, + s\+^^r^ II D^h\\ (A/ + f]A,K + l)4. 



Here 

= 128!.{10 + 6d^,)M+{d^+2)Y,Pl,vm + l)\ 

and 

S = 4^p^^ + ^ ^ |p^,„ - vp^Vp^||(gu,di-i - - g«,,dj| 

-\-'^Pv\qv,d,-i - qv,d,\\qv,dj-i - qv,dj\ 

V 

as well as 

V i 

Remark 3.2 For every v = 1,. . . ,n, the degree D{v) can be approximated by 
a Poisson distribution with parameter fj,y. From 121, Equation (1.23), p. 8, 

dTv{C{D{v)),Fo{^i,))<rain(l■—\Y.pl^. (3.2) 

\ Ml-/ „ 

Here dxv denotes the total variation distance; for two probability measures ^ 
and V on the same probability space with a— algebra B, we define 

dTvip,i^) = sup \fi{B) - iy{B)\. 
BeB 

The Poisson approximation is good for example when Pu v ~ — tt for all 

n — I 

u,v and some constant c; then x c ~ 0(1)- and Yiupty ~ • distri- 

n — 1 

butional regime where the normal approximation is plausible is when all degrees 
are moderate, fiy = 0(1) for all v, so that M — 0{n); then it is reasonable to 
think of \i :x n and ql^^._i ~ 0{1) as well as q^^dt — 0(1). In this regime, with 

p fixed, ^Y^Bi = O + X! '^^^ ^ ' ''' ^ • V^^df = 0(1), this 

yields an overall bound of the order . 
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Remark 3.3 The term S arises from the variance approximation; 

I^""" ~ Vp^VKi\\iqv,d.-i - qv,d,){qw,dj-i - qw,d,)\ 

V w 

vanishes when all p^v = ctre equal. 

Remark 3.4 In the case that Pu v ^ — for all u ^ v, now putting q^ = q^ d, 

n 

the approximating covariance simplifies to 

{n - l){d^ ~ c){dj - c) _ ^ 

Under the regime that di and dj are typical degrees, so that qd^ and qd^ are 
moderate, this expression will not in general tend to zero for i ^ j as n 
oo; the covariance does in general not vanish, and the degree counts will be 
asymptotically dependent. 

Remark 3.5 While our bounds are for smooth test functions h only, they could 
be generalised to non-smooth test functions along the lines of ]15}/ . Correspond- 
ing work is in progress, f^^, and to avoid duplicate work we restrict ourselves to 
smooth test functions. 

3.2 Poisson Process Approximation for the Truncated De- 
gree Sequences in ^(n, {pij}) 

The construction 12.11 of (X, X'^) allows to assess the distribution of the M- 
truncated degree sequence Dm := {D{v)l{D{v) > M),v € V) for an arbitrary 
integer M > 0. To this end, define Fm by {l,...,n} x {M,...,n — 1}, a 
subset of r, and restrict the definitions of X and X'^ to Tm to have X^/ := 
: {v,i) e Tm} and X^ := {X^^ .^ : {v,i) € Tm}, such that £(X^) = 
£(Xm I Xp — 1), for P G Tm- Construction 12.11 can be used to derive a Poisson 
process approximation, with respect to the total variation distance, for the point 
process 5m defined in (II. 2p , where the target Poisson point process O m on F m 
has intensity Xm — i^Xa, a & Tm)- 

Theorem 3.6 In ^(n, {py}), we have 

dry (/^(Sm), Po(Am)) < 6a/.i + bM,2, 

where 

bM,i = i^i^i^) > M)f , and 

bM,2 = 2 ^ ^ ¥{D{v) > M) (u) > M - 1). 



CTy- = l{i= j)nqd^ + nqdSd, 
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Since the total variation distance between the two processes also serves as an 
upper bound of the total variation distance between deterministic functions of 
the two processes, that is, dry (/^(/(Sa/ )), >C(/(6m))) < dry('C(SA/),/:(eAf)), 
where Qm ^ Po(AAf) and / is any deterministic function, we assess the dis- 
tribution of Da/ by taking the function / on point measures ^ on Tm as 

Tl-l 

/(^) := ^ ^ i)), w G V^. In this way, the target distribution £(/(6a./)) 

1=0 

gives rise to a multivariate compound Poisson approximation for Da/, in the 
next corollary. The result justifies the independence assumption among large 
vertex degrees as used when interpreting log-log plots for vertex degrees when 
the degrees are observed not in independent graphs, but in the same graph. It 
also bounds the departure from an independent point process in terms of the 
degree threshold M. 

Corollary 3.7 In the model'^^{n, {pij}), /ei Da/ denote the M -truncated degree 
sequence {D(v)l{D{v) > M),v G V), and Ya/ denote the compound Poisson 
vector {Y^^MtV € V), in which all components are independent and Yy^M = 
T,"~^iYy^i with Yy^i ~ Po(EX(^, i)). Then, with the bM,i's from Theorem \3.b\ 

dTv{C{TiM),C{YM)) < hMS + &M,2. 



Remark 3.8 Corollary \3.T\ is consistent with Theorem 3.13 given in Bollobds, 
Janson & Riordan (2007), where, in a fairly general sub-model of 'i^{n, {pij}), 
it is shown that the distribution of an individual vertex degree converges to a 
mixed Poisson distribution. In contrast. Corollary \3.7\ not only applies in the 
multivariate case, but it also provides an explicit error bound on the distance. 

Remark 3.9 Using — Tj^^yy^y ^ypux, we obtain from (j3.2p that 

nD{v) > M) < Po{fiy){[M, n-l]} + ^ pI^^ and 

¥{D^^\u)>M-l) < Po(^(j')){[M-l,n-2]} + i^^^ ^ p^- 

2^ti x£V\{u,v} 

This yields an upper bound for the quantities in Theorem \ 3.6[ we can use the 
Poisson distribution as a guideline for a good choice of M . These probabilities 
could be further bounded using Proposition A. 2. 3 in 

3.3 Simulations for the Correlation between Counts 

We now illustrate the dependence structure in four different random graph mod- 
els, all on n = 100 vertices, with independent edges. We estimate the correla- 
tions from 10,000 samples of graphs for each model. The models are as follows. 
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1. Ml. The first model is the Bernoulh random graph with pu v = P = —■ 

, , ' , n 

This graph is at criticality; some, but not aU, realizations may yield a 
giant component, see [5]. 

1 1 

2. M2. In this model, Pu,v = - if < |u — w| {mod 100) < 10, and p^.v = 

5 80 
if |m — {mod 100) > 10. This is a modified Newman-Moore- Watts small- 
world model, see [13], with 100 vertices; two vertices at most distance 10 

away from each other are connected with probability — , and two vertices 

5 , 

more than distance k away from each other are connected with probability 
1 

80' 

minfii v] 

3. M3. Here Puv = ' — for u ^ v; the smaller of the two vertices 

. V' 

determines the probability. 

4. M4. This model is motivated by Rasch-type models; for u < v, we set 
Puv = aM(3)a«(10), with 

r i/(*v^), u<- 

au{i) = < ^ ft • 

[ i/y/n, > 2 

Figure [1] shows the correlations between the degree counts in the four models; 
except for Model M3 there is an appreciable correlation even far away from the 
diagonal. 

Figure m shows the degree count correlations, firstly between degree counts 
for k and fc -I- 1, and secondly for degree counts of an asymptotically normally 
distributed degree count and successive degree counts; the quantile-quantile 
plots are given for re-assurance. We observe a strong negative correlation for 
degree counts which are close by, but then close to zero correlation with counts 
of large degrees. 

3.4 Simulations for Power-law Type Behavior 

Using Models Ml - M4, but now with n — 1,000 vertices, we plot the number 
of vertices of degree no less than d versus d itself, on a log-log scale. Despite 
the networks being created using independent edges, the plots seem to display 
a sharp linear decline, which could easily be mis-interpreted as displaying a 
power-law behaviour. 

These simulations confirm the pseudo-power-law phenomena, and therefore 
raise the issue that, without rigorous analysis, simulation-based claims of de- 
tecting power-law type behaviour, or scale-free behaviour, could be in fact un- 
reliable and misleading. The vertex degree distribution may not be a suitable 
visual method for distinguishing different network models. In contrast, our dis- 
tributional results help assess the joint distribution of vertex degrees under a 
fairly general null model. 
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Degree Count Correlation of M1 



Degree Count Correlation of M2 




Figure 1: Degree count correlation in Models Ml - M4- 
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QQ Plot of Sample Data 
versus Standard Normal 
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Figure 3: power law; log-log scale 



4 Proofs 

In this section, we provide proofs for the size-biased couphng construction 12 . l| 
as well as for Theorem 14.31 Theorem 13.11 and Theorem 13.61 First we prove 
Lemma 12.21 

Proof 4.1 [Proof of Lemma \2.2^ We first re-write f^{'x.i | Xd) as well as 
f~{'x.i I Xrf) hy writing out the set N[v) in terms of those vertices which remain 
fixed in the construction, and those which get added or removed, respectively. 
Here x.;, x^, yj, '^(i-j) and Z(„_]^_j_|_j-) are all subsets ofVy. Figure^ illustrates 
the set relation in J[ j and respectively. We have 

/+(x.|x<i) (4.1) 
= E E E y^F(7V(^;)=y,UZ(,_,)|i?(^;)=^), 

Z(i_j) C K \ 

and 

r(x.|xd) (4.2) 

n — 1 ^ 

= E E E — j-P(iV(«)=y,nz(„_i_,+,)|i?(«) = *). 

See Lemma 4-3.2 in Lin fTJjj for more details. 
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Figure 4: The set diagram for the expression H-l^ (left) and {4-^ T'^^ff^^A 
respectively. 



Now, from since, for any and yj, there are choices of 

we have 



- P(^(')^,) E E E F(iV(.)=y,Uz,_,,). 

Z(i_j, C K \ Xd 

Let X,; — {xi : x^ C Vy}. Note that oJi G Xj i/ and onZy if uJi can he uniquely 
decomposed as uJi = yj U '^(i-j) such that yj C x^ and C \ x^. Thus, 

as required. Similarly, from J^.iip we /ind f/iaf 

Xi:XiDXd 

= P(^(,') = ,) E E E P(A^W=y.nz(„_,_,,,). 

D K \ Xd 

Note that, uii S X.; if and only if oJi can he uniquely written as oJi = yj O 
Z(„_i_j_,_j) such that yj D x^ and Z(„_i_j+j) D K \ x,^. T/i-us, 



E r{^^ I X.) = ^77^7^ E - -0 = 1, 

as required. 



J(D(v) = 

XiiXiDXd V V ; y oJiGXi 
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Proof 4.2 (Proof of Construction [27T]) The goal of this proof is to show 
that, for (3 — {v, i) £ T , 

/:(x^) = /:(x|x^ = i), (4.3) 

where X = : {v,i) € F} and "KP = {X^^^ : {v,i) € F}. Indeed we shall 

show that the distribution of the constructed model '^^ [n, {pij}) is the same as 
the conditional distribution of the original model 'S{n,{pij}) given Xp = 1, that 
is, with Xedge = {^{a,b} ■ {a,b} £ E} and Xf^^^ = {X'^^ ^^y : {a,b} G E}, we 
meed to show that for all w :— {wa.t, {a, 6} G -E) G {0, 1}'^', 

n^edgl = W) = P{X,dge = W | = 1). (4.4) 

The desired equation (j4.3p then follows because X is a function ofX^dge- 

By definition of X^y ^-^, the right-hand side of i4-4\ l zero when Ti^^v^Wy^x 7^ 
i, and by construction in that case the left-hand side of \4-4^ zero also. As- 
sume that Yix£V^Wv.x = h then the right-hand sice of |^.^[) equals 



P(Xedge = W|X(,,,) = 1) = P(Xedge = W | D{v) = i) 

= P(X{„,;,} = wa,bya ^v,b^v; N{v) ^ {x e Vy : Wy^^ = 1} | D{v) = i) 
= P(^{a,6} - Wa,b ya^v,b^ v)F{N{v) = X, I D{v) = i), (4.5) 

where the last equality follows from the independence of the edges, as the condi- 
tion D{v) = i only affects {Xs^y j,}i^ G Vy), but not edges which do not contain 
V. On the other hand, the left hand side of i4-4\ l equals 

P{X[a,b} = Wa,b for alla^v, b^ v; x'^l'^^y = Wy^^ for x e Vy) 

= nX{aM = wa^b for alla^v,b^ ^Wix[l% - m„,, for x e Vy), (4.6) 

as the construction of x|,^'^g from X^dge affects only the edges with v as one of 
its end points, and the edges are independent. Note that, in ^JTS^, 

P{x\l';^y = Wy^^yx e Vy) - P(7V("^^)(i;) = X,). 

Hence to conclude that ^.6\ l equals ^4-5\ l, it remains to show that 

P(A^("^*)(u) = X,;) = PiNiv) = X, I D{v) = i). (4.7) 

Indeed, 

F{N^''''\v) =x,) 

= qy,,P{N{v)=^,\D{v)=i) (4.8) 

+ J2 J2 = ^^dlDiv) = d)F{N'^"''\v) = x,;|iV(t;) = x^), 

d:d<i XrfiXtjCXi 
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where the three terms corresponding to the coupling construction (|2.ip . Now, 
we calculate the sums over : D x^ and over x^ : x^ C x^ separately. In 
fact, for the first case d > i and x^ D x^, it follows from and J^.Jp that 

¥iN{v) =^d\D{v)= d)P(Ar(''^*) (v) - X, I N{v) = x^) (4.9) 

Xd D X. y^. c X. C y„ \Xd 

P(A^(t;) ^ JCdMNiv) ^ y, U Z(,_,)) 
^ P(D(w) = d)P(L>(w) = i) 



where 



nN{v) = xrf) 



n ^^'^) ( n -Pt>x) 

xey„\xd 



^mce Yj C Xj C Xd anrf Z(i-j) C K \ x^, we /laue Vy \ (y^ U Z(j_j)) D x^ \ x^ 
('see Figure^ for reference). Therefore, 

V{N{v) = x<i)P(7V(i;) - U Z(,_,)) 



2:exd\xi 



n (i-p-)) 

2;Gl'„\[yjUZ(._j)U(xd\xi)] 



n n p--){ 

= P(iV(?;) = xOP(Af (i^) = yj U Z(,_,) U (xd \ X,)) . 
Hence, from we /laue /or d > i </iai 

^ P{N{v) = Xrf I = d)P(Ar("^*) («) = X, I N{v) - Xd) 



Xd:XdDXi 

P{N{v) = X, 



P(i^(i;) = P(i?(«) = d) (^-j) 
EE E P(iV(w) = y, U Z(,_,) U (xd \ X,)) 



(4.10) 



Xd 3 X, c X. C V„ \ Xd 

/Since Yj Uz(i_j) is an i-sei (i.e. a set with i elements), (xd\xi) is a {d—i)-set, 
and {yj U Z(i-j)) H (x^ \ x,) = 0, we have that y-j U Z(i_j) U (x^ \ x.;) is a d-set 
in Xrf. Conversely, for any d-set u)d G X^, since x^ is fixed, we can decompose 
uJd as 



u^d = Yj U Z(j_j) U (xd \ Xj), 



(4.11) 
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such that Yj C x^, D and C 14 \ x^. Referring to Figure^ it is 

easy to show that for any u)d G ^d, there are ("^ solutions {yj,'kd,i{i^j)) 

to decompose iVd cis ^TTTP (^^^ Lemma 4-3.4 Lin (2008) for more details). 
Thus, it follows that 

il E y^P(iV(^')=y,Uz(,_,)U(x,\x,)) 

Xd D X, y^. c X. C V„ \ Xd 

- nN{v)^LJa) = F{D{v)=d), 

and from H^-IO^ , we have for d > i that 

J2 nN{v) ^^a\D{v)^ d)P{N^^^'^ (v) = X, |iV(z;) = x^) 



xjiXdDXi 

FiNiv) = X,) 
F{D{v) = i) ■ 

The case d < i and x^ C Xi is treated similarly, giving 

nN{v) = ^d\D{v) = d)¥{N<^''''Hv) = X, I N{v) = x^) 

XdiXdCXi 

F{N{v) ^ X,) 
F{D{v) = i) ■ 

See /Tiy /or details. Combining 14. 8\ ), 14.1^ and I4.1S\ >, 

P(7V("-^)(z;)=x,) = ffl^I^=P(7V(t;)=x,|i?(«)^*) 



(4.12) 



(4.13) 



as 



required in ?| ) complete the proof. 



The proof of Theorem l3.1l is based on the following theorem, which is similar 
to Theorem 1.2 in [S] but gives a multivariate normal approximation with respect 
to an alternative covariance matrix Eq for which ||Eo|| is straightforward to 
bound and which is close to E. We use the notation from [Qj. For a vector 

b g we let || b ||= max \bi\. More generally, for an array A = (oi.,), the 

l<i<p 

notation || • || is its maximal absolute value. For an array ^(w) = {ai(w)} of 
functions, || A \\~ supmax |aj(w)|. 

w * 

Theorem 4.3 Let W = (Wi, I < i < p) be a random vector in with non- 
negative components. Let A = (Ai, . . . , Xp) ~ EW and assume that Var(W) = 
E = (tJy ) exists. Let Sq = (cr° ) be a positive definite (invertible) p x p matrix. 
For each i — 1, . . . ,p let (W, W) be a random vector defined on a joint proba- 
bility space with having the W-size biased distribution in the coordinate. 
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Let h : JM' ^ R G Cjf , and let Nh = E/i(Z) where Z denotes a standard normal 
variable in . Then 

E/i(Eo^''^(W- A)) - Nh\ 



n2 



i=i j=i 

i=l j=l k=l 

t 

2 



i=i j=i 

Proof 4.4 (Proof of Theorem I4.3p The proof follows closely the lines of the 
proof of Theorem 1.2 in J^. The only difference is that instead of their decom- 
position (18), we use 



E |/i(E(7^''^(W - A)) - A^/ij 



(4.14) 

-M tK-4)^/(W)J (4.15) 
-e| HW]-W,){Wl-Wk) (4.16) 

r/ie bound for 1^4- for \4.16^ are as in [9^; and for J^. we obtain the 
bound 

i=l j=l 

This completes the proof. 

Proof 4.5 (Proof of Theorem 13. ip The proof is based on Theorem \4.^ and 
the size-biased coupling construction ] 2. 1\ Denote the randomly picked vertex by 
V. First note that, for all i ^ 1, . . . ,p, \W^^ - Wd^ \ < \D{V) - di\ -\- I, because 
at most \D{V) — di\ edges are added or removed; and the degree ofV is fixed to 
equal di in the size-biased distribution. Hence 

iW^;-Wd,)iW^:~W,,)\ < E{D{V)+d, + lf 

< 2{d, + l)^ + 2ED{Vf. 
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Now for di the vertex V ^ v is chosen proportional to qv.di', hence 

1 " 

¥.D{Vf = —y^q,,d,{VarD{v) + iED{v)f), 
and we use (13.11) to bound the variance. 



The bound on yVarE[W^^ ~~Wdj | W] is straightforward and follows the 
lines of First note that 

Var^W^l - Wd, I W] < VarE[W^^^ - Wd, \ ^(n, {py})]. 

With the notation ()2.ip and p.2p . we abbreviate 

= E HNiv)^^d){l{Diu)=d, + l)-l{Diu)^d,)} 

X E /^(xd, |xd) 



and 



(3{u,v) (4.17) 
^E E HN{v)=^d){HDiu)=d,-l)-MD{u)=d,)} 



f (XdJXd). 

We note that 



X 

Xd,Dx<j;tiex<j 



\a{u,v)\<^ and \P{u,v)\<^. (4.18) 



Then 



E[W^;-Wd,\':^{n,{p,,})] 

= ^^l(u-f)a(u,w)+^^l(u7^w)/3(u,«) 
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This gives that 

VarE[w;l^ - Wd, | W] 
< 4iyar^^l(u-w)a(w,u) (4.19) 

I \ ^ u^v j 

+Var\^'^\{ur^ v)i5{u,v)\ (4.20) 
+Var (^^ Y,<lv,dMD{v) ^ d,)l{i - j)j (4.21) 

+^«^(^E9-.'^'1(^(")='^^)1(*^J))} ■ (4.22) 

Firstly, for (j4.19p . with (j4.18p . and u' ^ u,v as well as v' ^ u, v, 

2 9^ d 



A? ' 

\Cov{\{u ^ v)a{u, v); l{u' ^ v)a{u' , u)) | < 2pu,vPu',v ^' 



|Coi;(l(u ~ v)a{u, v); l{u ^ v')a{u, u')) | < 2pu.vPu,v' 



A? ' 

(lv,di<lv' ,di 

A? ■ 



For u' ^ u,v and v' ^ u, u' , v, let C = C{u, u' , v, v') — \{u 9^ u', u 7^ u', v 7^ 
v' , u' ^ v). Then 

f{C{u,u\v,v') = l) > I- {Pu,u' +Pu.v' +Pv.v' +Pu',v)- (4.23) 

Put X{u, v) = E1(m ^ v)a{u, v); then for u, u' , v, v' mutually different, 

Cov{Tl{u ^ v)a{u, v), l{u' ^ v')a{u' , u')) 

= E[(1(u ~ v)a{u,v) - X{u,v)){l{u' - v')a{u' ,v') - X{u\v'))\C = l] 
P(C = 1) 

— A(u,w))(l(u' ~ v')a{u' ,v') — A(m',w'))|C — O] 

P(C = 0). 

As l{u ^ v)\a{u,v)\ < l{u ^ u)^Y~^ by (|4.18p and as the edge indicators are 

Xi 

independent, we can bound 

E[(l(u - v)a{u, v) - A(it, w))(1(m' - v')a{u , v') - A(u', i;')) |C = O] 
P(C = 0) 

— 4 —2 Pu,vPu' ^v' {Pu,u' ~^ Pu,v' Pv,v' '^Pu'^v^- 
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As a{u,v) is a random variable which depends only on {l{u ^ u), l(w; ^ 
u),l{'W ~ v),w ^ u,v}, it follows that conditional on C = 1, a{u,v) and 
a{u',v') are independent. Moreover, 

E[l{u ~ v)a{u,v) - X{u,v)\C = l] 

= E E f^i-^^i-'^) 

d>di x<i:«ex<i xrf^ Cxd;«^X(j^ 

x^[l{N{v)=^d){l{D{u)=dj + l)-l{D{u) = dj)]\C =1] 
-E[l(iV(w) = Xd) {l{D{u) = d, + 1) - l{D{u) = d,)}] }• 
Re-grouping the terms and conditioning on u ^ v give that 

¥.[l{N{v) = Xd)l(M - v)l{D{ii) ^ dj + 1)|C = 1] 
-E[l{N{v) = Xd)Hu v)l{D{u) = dj + 1)] 

= [P(A^("'"''"'H«) = Xd \ {u}) - P(iV(")(z;) = Xd \ {u})] 

_P(iV(")(u) = Xd \ {u}) [P(£»(")(m) = dj) - P(£»("' = dj)] }. 

#ow, conditioning on whether or not u' v and v' v gives that 

El(iV("'">')(t;) = Xd \ {u}) - El(iV(«)(t;) = x^ \ {u}) 

< (Pu'.v+Pv',v) {P(A^("^"''"')(^') = Xd \ M) 
+P(^^("'"''"'H^') = Xd\K^/'}) 

+P(iV("'">') (v) = Xd \ {m, v'}) + ¥{N^"'^'^^'\v) = Xd \ {u, u\ v'})} . 

Similarly, P(£>('')(u) = dj) - = dj) < Pu,u' + Pu,v'- Hence 

E[l{N{v) = Xd)l(M - v)l{D{u) = + 1)|C = l] 
-E[l(Af(t;) = Xd)l(u ~ = dj + 1)] 

< {(P„',. +P.',.)P(iV^"'"'''''n^^) e A) 

= Xd \ {«})} , 

where A = {xd \ {u}, Xd \ {u, w'}, Xd \ {m, i"'}, Xd \ {u, u', v'}}. Combining these 
bounds, 

A(u, w)) - v')a{u\ v') - X{u', v'))\C = l] 



^ Q^Pu^vPu' ^v' iPu^u' ~^ Pu^v' ~^ Pu' ,v ~^ Pv' 



2 QvjdiQv' ,di 
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A similar argument holds for (|4.20p . involving l(u 7^ v)(3(u,v); recall (j4.17p . 
Firstly, 

Var{l{u v)p{u,v)) 

'iv.d 



< ^ijl^Ef^ ^ l{Niv)=^,){liDiu)=d,-l)-MD{u)=d,)} 



d<di x^-.u^Xd 

X l(u ^ Xd) ^ /"(xd, |xd) 



W^e bound the probability that vertex u is picked to be added to the neighbours 
ofv, ifN{v) = Xd; 

F{u picked] N{v)^Xd) = l(u^Xd) /"(Xdjxd) (4.24) 

(«) 

< Pu/-^^. (4.25) 

Mi/i (|4.25p we obtain that 

J2J2^ar{l{u^v)(3{u,v)) < ^^p.^J^^j . (4.26) 



Similarly as above, we obtain 

EE E Cov{l{uTl^v)P{u,v),liu' ^v)f3{u',v)) 

^ (4.27) 



and 



Cov{liu7^v)P{u,v),liu7Cv')Piu,v')) 
^ 2EE E Pu,.Puy'^^^^^^^0^. (4.28) 



Now assume that u,v,u' and v' are all distinct. We refine the definition of 
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P; fort =1,2, define p^*^ by 

X 



d<di Xd-u^Xd 

p(^){u,v) = E HNiv)=^d;Diu) = d,) 

d<di x^-.u^Xd 

E /"(xdjxd). 

Xdj Dx<i;M6x<jj 

+(1 - p„,,)P(£>('')(w) = d,- - l)P(Ar(")(t;) = Xd)} 
P(u picked] N{v) = x^) 

and 

= E E nu picked] N{v)=^,) 

' d<di d' <di Xd-.u^xd y d' -u^y di 

X P(?i' picked I iV(w') = Yd/) 

xP(7V(^;) = xd, 7V(^;') = Yd', = dj - 1, D{u') = dj - 1). 
Moreover, when conditioning on C = 1, the independence of the edges gives 
W{N{v) = Kd,N{v') = yd',D{u) = d, - l,D{u') = - l|C = l) 

= p(7v(«^«''"') {v) = Xd)P(iV("^^'"') («') = Yd') 

p(d('''"''''')(u) = dj - i)P(i:)("'"^^')(u') = dj - 1). 

Conditioning on whether or not u ^ v and u ^ v' yields 
Cov{l{u 7^ v)p'^^\u,v), l{u' 7^ v')l3^^\u' ,v')) 

= s^-^Y.T. E E nu picked] N{v) = ^,) 

* d<di d' <di Xdiui^Xdyd'-u^ydi 

xF{u' picked] N{v') = y^') 

|p(7V("'"''''')(w) = Xd)P(A^("^^'"')(i;') = ya')F{D^'''"'^'''Hu) = dj - 1) 
xP(D("''''"')(u') dj - 1) - P(i:'(^)(u) = dj - l)P(7V(")(i;) = Xd) 
x¥{D^^'\u') = dj - l)P(7V("')(t;') = yd')} + +i?2, 
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where from (I4.25P we immediately get 



\Ri\ < i-na-x{pu,v,Pu',v'}Pu,vPu'.v' 



(u) (u') 
% ,di-l%' Ai-l 



and, with (|4?23l) . 



= {Pu,u' +Pu,v' +Pv,v' +Pu',v)Pu,vPu',v' 



(u) («') 

%,di-l1v' .dj-l 

A? 



Again conditioning on the presence 0/ edges, we obtain 
(7v(-"'.-')(„) = xrf) - P(7V(")(t;) - Xd) 
< l({u',t;'}nx<i ^0)P(7V(")(«) -Xd) 

+p„.,„P(A^("'"'''^')(w) = Xd I u' ~ 1-, w' - t;)} . 

We also have that 

V[D'^''\u)=d,^l-Ur/.u',Ur/.v')-nD^^\u)=d,^l)\ < Pu^u'+Pu,v'- 

With similar bounds for the other two terms we note that the sums over Xd and 
Yd' still include a term of the form P(A^("^" \v) = x^;?/' '/^ v\v' ^ v) or 
similar, so that these sums can be bounded by 1. Moreover, 

v'} n xd ^ 0)P(7v(")(w) = xd) < p„,,„ + 

We conclude that 

Cov{l{u tC v)/3(i) (u, w), 1{u' tC {u\ v')) 

— (yPu,v Pu' .v' ~t~" Pu.u' Pu^v' ~(~ Pv^v' ~(~ Pu' .v)Pu,vPu' ,v 



(n) («') 

A? 



(rf? + 2). 

/n f/ie same iwa?/ we can bound |Coi;(1(m 7^ v)l3'^^\u,v), l{u' 7^ i;')/?'^^ ^^')) 
T/iMS we obtain that 

EE E E Cov{l{u^v)l3{u,v),l{u' ^v')P{u',v')) 



< 4(d? + 2)EE E E 



(«) («') 

%,di-l%' ,di-l 

A? 



X {pu,v +Pii',v' +Pu,u' +Pu,v' +Pv,v' + Pu' ,v)Pu,vPu' .V' ■ (4.29) 
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For (j4.2ip use the hound 



A- A 



Using that 

ICi-9-'i.l < P(i?^^^M^i^(?;))=p„.„, (4.30) 
we obtain for (I4.2ip f/iaf, w/ien i = 



Var 



2 ^ ^ 

- T2^^1^■.d^(lu,diPu,v, (4.31) 



V u 



where we use the cojiveiitioTi py_y = 1. Similarly, for (|4.22p . when i ^ j, 

l^ar I !(£>(«)= dj) j < ^ ^ ^ gu,d.9u,<i,Pu,D- (4.32) 

Combining the bounds for wii/i (|i3T|) . (g^H), g^T]), (li?^ . anrf g^H]), 

(|4.30p anc? (|4.32p . and using crude bounds such as Qv.di < I, we obtain that 

VarE[W^;-Wd,\W] < B^/X^ 

with Bi given in the statement of Theorem \3.1\ 

The next step is bounding \\ Sq ^^'^ \\. Define h^{i) — y^QvAt , md let by = 
{by{l), . . . ,by{p))^ and Dy = diag{by{i)). Further let a denote the p x 1 vec- 
tor with entries aj = \/Ww{lw,dj~i — <lw,dj)- This gives Sq = '^^Dy{Ip — 

W V 

hyh^)Dy + aa"^, where Ip is the p x p identity matrix. For any matrix A, let 
Pi(^) < • • • ^ Ppi-^) denote the eigenvalues of A in increasing order. By Weyl's 
Theorem ( UO^ . Theorem 4-3.1), 

pi(So) > ^pi(i:'^,B„D„) +pi(aa'^) > ^pl(A,B^,i?^,). 

V V 

Letting By — Ip — hyh^ , it can be shown that the eigenvalues of By are 1, with 
multiplicity p ~ 1, and pi(By) = 1 — b^b„ the least eigenvalue corresponding to 
the eigenvector b„ . Now, using the Rayleigh-Ritz characterization of eigenvalues 
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(fW^, Theorem 4.2.2), 



Pl[DyByDy) = Him = =111111- 



y y^D^^y 
. Pi{Bv) . n ^ 
Pp{Dv ) * V 



It therefore follows that 



iSo^/^ll < Pp(S-^/^)= _ (4.33) 

Pi(So ) 



Finally we bound ^ |cr,., - o-y |. Mi/i ()4.30p . 



-1/2 

=: r. 



.0 I 



v,w 

— VPvVPw{qv,di-l — qv,di){<lw,dj-l — IwAj)^ 

< 4^p^„+ ^ \pwv - VP^VP^\\iQv,di-i - qv,di)iqw,d,-i - qw,dj)\ 
+ Y^Pv\qvA,~i - qvAMqv,dj~i - qv,dj\ =■ S. 

V 

Collecting the bounds and using that q^^di 5: 1 gives the result. 
Proof 4.6 (Proof of Theorem [3T6)) Recall that 

= EX(„^,) = ¥{D{v) = i). 
Based on Theorem 10. B in Barbour, Hoist & Janson (1992), we obtain that 

dTv{C{-^M).Po{XM)) 

n— 1 ( n—1 n—1 

n—1 n—1 ^ 

+EE^-E Eki^k.)-^(:;2i • 

veV i=M ueVv j=M ) 

Note that, the first summand in the bracket is for the case u — v for j ^ i, and 
the second summand in the bracket is for u e for all j, covering all {u,j) 
except the case {u,j) — {v,i). Since ^(^jj — for j ^ i and EX(„ = qv,j, 
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the first summand in the bracket is equal to Tiy^v'^i'=Ml'",i^^=M j^il^-j ■ This, 
together with the summand outside the bracket, yields T,y^v[^'i=MQv,if ■ 

Let 5{u,v,i,j) — E|X(„ — x'"i\^^j^\ for u e V^, then the construction (|2.ip 
gives 

S{u,v,i,j) 

= E I-''' E = ^^dlDiv) - rf)E[l^(«,,) - ^(;;,f)l I N{v) = xd 

d:d>i x^iCVi; 

+ ^ ^ P(A^(«) = Xd|I?(i;) = d) 

d:d<i XfiCZVv 

E[l^K.)-<f)l|^(-)=x. . 

Let Si{u,v,i, j) and S2{u,v,i, j) denote the above two terms respectively. We 
first calculate Si(u,v,i,j). Note that, if D{v) = d> i and N(v) = x^;, then by 
construction \2. 1[ — = 1 if and only if vertex u satisfies (1) u Cz x^;, 

(2) D{u) = .7 + 1 or j in 'i^{n, {pij}) and (3) u v in '^''^'^\n,{pij}), namely 
u is not adjacent to v in the graph. Hence, di{u,v,i, j) can be written as 

Si{u,v,i,j) = Y.1-'d nNiv)^Xd\Div)^d) 

d:d>i Xc^CV^,:^iGXc^ 



•E 



=j + l or ^ v} N{v) ^ x^ 



Conditioning on the event "D{u) = J + 1 or j", since the neighbourhood of 
vertex v does not contain any relevant information for the degree of vertex u 
once we know whether or not u ^ v, we have 



6i{u,v,i,j) 

<lv,d J2 ^(^(") = I D{v) = d) 



(4.34) 



■F{D{u) - J + 1 or j I X{„,„} = 1) • P(X|;;;;\ = I N{v) = x^). 

Ifd>iandue x^, then P(x|^''„-'j = | N{v) = x^) is the probability that 
the edge {u,v} is deleted in the construction \2.l\ and hence, by l\2.1\ . 



P(X|:;:\=0|7V(^;)=X,) 



For \4^.34% we then obtain 
6i{u,v,i,j) -- 



XiCZVv :Xi Cxd ,ii^Xi 

E /+(x»|xd). (4.35) 

x-i C Vi, :xi CXrf jU^Xi 



EE E /"(-^i-^) 

d:d>i XdCyt>:«exd Xi C V„ ix; Cxj ,ti^Xi 

■nN{v) = ^d)nD{u) =j + lorj\ = 1). (4.36) 
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Next, we find S2{u,v,i, j) by a similar argument as for Si{u,v,i,j). If 
D{v) — d < i and N{v) — x^, then by construction ] 2. 11 \X(^u,f) ^ ■^{uj') \ ^ ^ */ 
and only if vertex u satisfies (1) u ^ Xd, (2) D{u) — j — 1 or j in '^{n, {pij}) 
and (3) u ^ v in 'i^^^'^\n,{pij}). Hence, following the same argument as for 
6i{u,v,i, j), we arrive at 

h{u,v,i,j) 

d:d<i XdCVi,:M^Xd Xi C Vf :Xi Dxd ,tiGXi 

■F{N{v) = Xd)P(-D(u) = .7 - 1 orj \ = 0). (4.37) 

Now, the sum of at !i4.36\ l Ii4-37\ l gives S{u,v,i, j), and hence 

drvi^C^M), Po{\m)) < bM,l + &Af,2 + bM,3, 



where 



JM.2 



n — 1 n — 1 

EEEEE E E f^(-^\-^) 

veV i=M ueVi, j=M d:d>i x^CV^-.uexa x,- C K :xi CXd ,u^Xi 

■¥iDiv) = i)FiN{v) = Xd)Pp(M) = j + 1 orj \ = 1), 



and 



n— 1 n— 1 



^M,3 = EEEEE E E .r(-^i-^) 

veV i=M ueV^ j=M d:d<i xaCV^-.u^Xa Xi C V„ :Xi DXd .tiGXi 

•P(I?(t;) = t)P{N{v) = XdMDiu) - .7 - 1 orj | = 0). 

/n particular, removing the constraints of vertex u on choosing x^ and x^, 
and using Lemma \2.2[ we obtain that 



ri-l 

bM,2 < 2 ^ E E ^(^(^) > ^ I ^{"-} = 1) 

f ev uGK i=M 

< 2 ^ ^ P(7?(«) > M) P(i^(u) > M I = 1) 

and similarly bM^z ^ 2 ^ ^ P(i:)(w) > M)f{D{u) > Af - 1 1 X{u^^} = 0). 

Afow, wif/i both ¥{D{u) > M \ 1) and F{D{u) > M - 1 1 X^^.v} = 0) 

egita/ to P{D'-^\u) > M — 1), the assertion follows. 
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